Improving Classification Accuracy Assessments with Statistical Bootstrap Resampling Techniques
نویسنده
چکیده
The use of remotely sensed imagery to generate land cover models is common today. Validation of these models typically involves the use of an independent set of ground-truth data which are used to calculate an error matrix resulting in estimates of omission, commission, and overall error. However, each estimate of error contains a degree of uncertainty itself due to 1) conceptual bias, 2) location/registration and coregistration errors, and 3) variability in the sample sites used to produce and validate the model. In this study, focus was not placed upon describing land cover mapping techniques, but rather the application of bootstrap resampling to improve the characterization of classification error, demonstrate a method to determine uncertainty from sample site variability, and calculate confidence limits using statistical bootstrap resampling of 500 sample sites acquired within a single Landsat 5 TM image. The sample sites represented one of five land cover categories (water, roads, lava, irrigated agriculture, and rangelands) with each category containing 100 samples. The sample set was then iteratively resampled (n=200) and 65 sites were randomly selected (without replacement) for use as classification training sites while the balance (n=35) were used for validation. Imagery was subsequently classified using a maximum likelihood technique and the model validated using a standard error matrix. This classification-validation process was repeated 200 times. Confidence intervals were then calculated using the resulting omission and commission errors. Results from this experiment indicate that bootstrap resampling is an effective method to characterize classification uncertainty and determine the effect of sample bias.
منابع مشابه
Improving Coverage Accuracy of Block Bootstrap Confidence Intervals
The block bootstrap confidence interval based on dependent data can outperform the computationally more convenient normal approximation only with non-trivial Studentization which, in the case of complicated statistics, calls for highly specialist treatment. We propose two different approaches to improving the accuracy of the block bootstrap confidence interval under very general conditions. The...
متن کاملImage Registration Accuracy Estimation Without Ground Truth Using Bootstrap
We consider the problem of estimating the local accuracy of image registration when no ground truth data is available. The technique is based on a statistical resampling technique called bootstrap. Only the two input images are used, no other data are needed. The general bootstrap uncertainty estimation framework described here is in principle applicable to most of the existing pixel based regi...
متن کاملMethods for fuzzy classification and accuracy assessment of historical aerial photographs for vegetation change analyses. Part I: Algorithm development
Image classification of historical aerial photographs is very useful for the study of medium-to-long term (10–50 years) vegetation changes. To determine the quality of information derived from the classification process, accuracy assessment of the classification is implemented. Error matrix, which is primarily used in remote sensing for accuracy assessment, is typically based on an evaluation o...
متن کاملBootstrap resampling as a tool for radio - interferometric imaging fidelity assessment
We report on a numerical evaluation of the statistical bootstrap as a technique for radio-interferometric imaging fidelity assessment. The development of a fidelity assessment technique is an important scientific prerequisite for automated pipeline reduction of data from modern radio interferometers. We evaluate the statistical performance of two bootstrap methods, the model-based bootstrap and...
متن کاملAssessing Uncertainty in LULC Classification Accuracy by Using Bootstrap Resampling
Supervised land-use/land-cover (LULC) classifications are typically conducted using class assignment rules derived from a set of multiclass training samples. Consequently, classification accuracy varies with the training data set and is thus associated with uncertainty. In this study, we propose a bootstrap resampling and reclassification approach that can be applied for assessing not only the ...
متن کامل